Joint Reference and Relation Extraction from Legal Documents with Enhanced Decoder Input

نویسندگان

چکیده

Abstract This paper deals with an important task in legal text processing, namely reference and relation extraction from documents, which includes two subtasks: 1) extraction; 2) determination. Motivated by the fact that subtasks are related share common information, we propose a joint learning model solves simultaneously both subtasks. Our employs Transformer-based encoder-decoder architecture non-autoregressive decoding allows relaxing sequentiality of traditional seq2seq models extracting references relations one inference step. We also method to enrich decoder input learnable meaningful information therefore, improve accuracy. Experimental results on dataset consisting 5031 documents Vietnamese 61,446 show our proposed performs better than several strong baselines achieves F 1 score 99.4% for task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Temporal information extraction from legal documents

The aim of this paper is to analyze what kinds of temporal information can be found in different types of legal documents. In particular, it provides a comparison of different legal document types (case law, statute or transactional document) and how one can do further reasoning with the extracted temporal information.

متن کامل

Unsupervised Relation Extraction From Web Documents

The IDEX system is a prototype of an interactive dynamic Information Extraction (IE) system. A user of the system expresses an information request for a topic description which is used for an initial search in order to retrieve a relevant set of documents. On basis of this set of documents unsupervised relation extraction and clustering is done by the system. The results of these operations can...

متن کامل

Reference Line Extraction from Form Documents with Complicated Backgrounds

Form document analysis is one of the most essential tasks in document analysis and recognition. One of the most fundamental and crucial tasks is the extraction of the reference lines which are contained in almost all form documents. This paper presents an efficient methodology for the complicated grey-level form image processing. We construct a non-orthogonal wavelet with adjustable rectangle s...

متن کامل

Combining NLP Approaches for Rule Extraction from Legal Documents

Legal texts express conditions in natural language describing what is permitted, forbidden or mandatory in the context they regulate. Despite the numerous approaches tackling the problem of moving from a natural language legal text to the respective set of machine-readable conditions, results are still unsatisfiable and it remains a major open challenge. In this paper, we propose a preliminary ...

متن کامل

Catch Phrase Extraction from Legal Documents Using Deep Neural Network

This paper is based on finding and extracting important key phrases (catchphrase) from a document from which the the document can be summarized. This is important as this will reduce time consumption in summarization of documents. This work is realizedwith the help of deep neural network to train anmodel for recognizing such important key phrases based on various calculated parameters.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Cybernetics and Information Technologies

سال: 2023

ISSN: ['1311-9702', '1314-4081']

DOI: https://doi.org/10.2478/cait-2023-0014